Stream-learn — open-source Python library for difficult data stream batch analysis

نویسندگان

چکیده

Stream-learn is a Python package compatible with scikit-learn and developed for the drifting imbalanced data stream analysis. Its main component generator, which allows producing synthetic that may incorporate each of three concept drift types (i.e., sudden, gradual incremental drift) in their recurring or non-recurring version, as well static dynamic class imbalance. The conducting experiments following established evaluation methodologies Test-Then-Train Prequential). Besides, estimators adapted classification have been implemented, including both simple classifiers state-of-the-art chunk-based online classifier ensembles. utilises its own implementations prediction metrics binary tasks to improve computational efficiency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis

Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommen...

متن کامل

Application of “Sink & Source” and “Stream wise” Methods for Exergy Analysis of Two MED Desalination Systems

Utilization of fossil fuel for supplying of requires energy of desalination systems is common. On the other hand, solar energy is one of the high-grade energies in the world that can be found specifically in hot weather places. Therefore, utilization of solar energy for operation of desalination systems will reduce greenhouse gases and is a good alternative way. Common exergy analysis method (s...

متن کامل

Analytical Data Mining for Stream Data Analysis

The main idea behind this research relies on analytical data mining functions to handle data streams. Given the characteristics of the data stream, the new methods and techniques for stream data analysis must conduct advanced analysis and data mining over fast and large data streams to capture the trends, patterns and exceptions. Besides, much of such data resides at rather low level of abstrac...

متن کامل

Geospatial Data Stream Processing in Python Using Foss4g Components

One viewpoint of current and future IT systems holds that there is an increase in the scale and velocity at which data are acquired and analysed from heterogeneous, dynamic sources. In the earth observation and geoinformatics domains, this process is driven by the increase in number and types of devices that report location and the proliferation of assorted sensors, from satellite constellation...

متن کامل

Visual analysis of stream data

We present the DEVise toolkit designed for visual exploration of stream data. Data of this type are collected continuously from sources such as remote sensors, program traces, and the stock market. A typical application involves looking for correlations, which may not be precisely deened, by experimenting with graphical representations. This includes selectively comparing data from multiple sou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2022

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2021.10.120